Word Formation Is Aware of Morpheme Family Size
نویسندگان
چکیده
Words are built from smaller meaning bearing parts, called morphemes. As one word can contain multiple morphemes, one morpheme can be present in different words. The number of distinct words a morpheme can be found in is its family size. Here we used Birth-Death-Innovation Models (BDIMs) to analyze the distribution of morpheme family sizes in English and German vocabulary over the last 200 years. Rather than just fitting to a probability distribution, these mechanistic models allow for the direct interpretation of identified parameters. Despite the complexity of language change, we indeed found that a specific variant of this pure stochastic model, the second order linear balanced BDIM, significantly fitted the observed distributions. In this model, birth and death rates are increased for smaller morpheme families. This finding indicates an influence of morpheme family sizes on vocabulary changes. This could be an effect of word formation, perception or both. On a more general level, we give an example on how mechanistic models can enable the identification of statistical trends in language change usually hidden by cultural influences.
منابع مشابه
Early and Late Effects of Morphological Decomposition: Brain Correlates of Family Size Effects on Complex Words and Pseudowords
In three ERP experiments, morphology-based decomposition of words and pseudowords was explored in Spanish. Subjects were asked to perform a lexical decision task on morphologically simple (e.g. ‘sun’) and complex (e.g. ‘allerg+ic’, ‘allerg+ist’) word strings, while family size for both lexemes/stems (S-FS) and morphemes/suffixes (M-FS) was varied. In Experiment I, earlier results by Schreuder &...
متن کاملA Hybrid Morpheme-Word Representation for Machine Translation of Morphologically Rich Languages
We propose a language-independent approach for improving statistical machine translation for morphologically rich languages using a hybrid morpheme-word representation where the basic unit of translation is the morpheme, but word boundaries are respected at all stages of the translation process. Our model extends the classic phrase-based model by means of (1) word boundary-aware morpheme-level ...
متن کاملA TRAFFIC-AWARE MECHANISM TO ADJUST CONTENTION WINDOW IN 802.11E WIRELESS LANS
<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...
متن کاملA TRAFFIC-AWARE MECHANISM TO ADJUST CONTENTION WINDOW IN 802.11E WIRELESS LANS
<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...
متن کاملDerivational morphology and base morpheme frequency
0749-596X/$ see front matter 2010 Published b doi:10.1016/j.jml.2009.01.003 * Corresponding author. Fax: +44 (0)1223 766452 E-mail addresses: [email protected] (M.A. Ford). Morpheme frequency effects for derived words (e.g. an influence of the frequency of the base ‘‘dark” on responses to ‘‘darkness”) have been interpreted as evidence of morphemic representation. However, it has been s...
متن کامل